Gated Boltzmann Machine for Recognition under Occlusion

نویسنده

  • Charlie Tang
چکیده

Unconstrained real world environments are often full of clutter. Therefore, robustness to occlusion is vital for any artificial recognition system. Recently, the Deep Boltzmann Machine (DBM) has been shown to be good at generative modeling and recognition of visual objects. In this work we develop an extension to the DBM framework to make the system more accurate when recognizing handwritten digits under partial occlusion. The key is the introduction of additional indicator random variables which specify where in the image to ignore the occluder. The new model is still a Boltzmann machine as some extra terms are added to the DBM energy function. During inference, the model tries to figure out what are the occluder and the “object” given an occluded image. In addition, we can easily transfer the learned occluder model to other DBMs learned on different types of data, e.g. faces or objects.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning to Discriminate in the Wild: Representation-Learning Network for Nuisance-Invariant Image Comparison

We test the hypothesis that a representation-learning architecture can train away the nuisance variability present in images, owing to noise and changes of viewpoint and illumination. First, we establish the simplest possible classification task, a binary classification with no intrinsic variability, which amounts to the determination of co-visibility from different images of the same underlyin...

متن کامل

Classification Factored Gated Restricted Boltzmann Machine

Factored gated restricted Boltzmann machine is a generative model, which capable to extract the transformation from an image pair. We extend this model by adding discriminative component, which allows directly use this model as a classifier, instead of using the hidden unit responses as features for another learning algorithm. To evaluate the capabilities of this model, we have created a synthe...

متن کامل

Gated Boltzmann Machine in Texture Modeling

In this paper, we consider the problem of modeling complex texture information using undirected probabilistic graphical models. Texture is a special type of data that one can better understand by considering its local structure. For that purpose, we propose a convolutional variant of the Gaussian gated Boltzmann machine (GGBM) [12], inspired by the co-occurrence matrix in traditional texture an...

متن کامل

Learning and Selecting Features Jointly with Point-wise Gated Boltzmann Machines

Unsupervised feature learning has emerged as a promising tool in learning representations from unlabeled data. However, it is still challenging to learn useful high-level features when the data contains a significant amount of irrelevant patterns. Although feature selection can be used for such complex data, it may fail when we have to build a learning system from scratch (i.e., starting from t...

متن کامل

Theta-RBM: Unfactored Gated Restricted Boltzmann Machine for Rotation-Invariant Representations

Learning invariant representations is a critical task in computer vision. In this paper, we propose the Theta-Restricted Boltzmann Machine (θ-RBM in short), which builds upon the original RBM formulation and injects the notion of rotationinvariance during the learning procedure. In contrast to previous approaches, we do not transform the training set with all possible rotations. Instead, we rot...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010